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Abstract 

K^ I Consider the barycentric subdivision which cuts a given triangle along its medians to produce 

six new triangles. Uniformly choosing one of them and iterating this procedure gives rise to a 

ff^ . Markov chain. We show that almost surely, the triangles forming this chain become flatter and 

cn I flatter in the sense that their isoperimetric values goes to inflnity with time. Nevertheless, if the 

C^ ' triangles are renormalized through a similitude to have their longest edge equal to [0, 1] C C (with 

also adjacent to the shortest edge), their aspect does not converge and we identify the limit 

set of the opposite vertex with the segment [0,1/2]. In addition we prove that the largest angle 

converges to vr in probability. Our approach is probabilistic and these results are deduced from the 

, , ■ investigation of a limit iterated random function Markov chain living on the segment [0,1/2]. The 

r^ ', stationary distribution of this limit chain is particularly important in our study. In an appendix 

C^ ' we present related numerical simulations (not included in the version submitted for publication). 
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1 Introduction 

Let A be a given triangle in the plane (to avoid triviality the vertices will always be assumed not 
to be all the same). The three medians of A intersect at the barycenter, this cuts it into six small 
triangles, say Ai, A2, A3, A4, A5, Ag. Next, each Aj, for i £ [1,6]] (which denotes the set 
{1, 2, ..., 6}), can itself be subdivided in the same way into six triangles, (Ajj)jgp gj. Iterating this 
barycentric subdivision procedure, we get 6"" triangles (A/)/g|i g|n at stage n G N. It is well-known 
numerically (we learned it from Blackwell [2], see also the survey by Butler and Graham [3]) and 
has been recently proved (cf. Diaconis and McMullen [6| and Hough [Sj) that as the barycentric 
subdivision goes on, most of the triangles become flat. The original motivation for this kind of 
result was to show that the barycentric subdivision is not a good procedure to construct nice 
triangularizations of surfaces. For more information on other kinds of triangle subdivisions, we 
refer to a recent manuscript of Butler and Graham [3]. The goal of this paper is to propose a new 
probabilistic approach to this phenomenon. 

First, we adopt a Markovian point of view: Let A(0) :- A and throw a fair die to choose 
A(l) among the six triangles Aj, i S [1,6]]. Continuing in the same way, we get a Markov chain 
(A(n))„gN: if the n*^ first triangles have been constructed, the next one is obtained by choosing 
uniformly (and independently from what was done before) one of the six triangles of the barycentric 
subdivision of the last obtained triangle. Of course, at any time n G N* (N* stands for N \ {0}), 
the law of A(n) is the uniform distribution on the set of triangles {A/ : / G [1, 6]]"}. So to deduce 
generic properties under this distribution it is sufficient to study the chain (A(n))„gN. 
In order to describe our results more analytically, let us renormalize the triangles. For any non- 
trivial triangle A on the plane, there is a similitude of the plane transforming A into a triangle 
whose vertices are (0,0), (0,1) and {x,y) G [0,1/2] x [0,\/3/2], such that the longest (respectively 
the shortest) edge of A is sent to [(0, 0), (0, 1)] (resp. [{0,0), {x,y)]). The point {x,y) is uniquely 
determined and characterizes the aspect of A (as long as orientation is not considered, otherwise 
we would have to consider positive similitudes and x would have to belong to [0, 1]). Any time 
we are interested in quantities which are invariant by similitude, we will identify triangles with 
their characterizing points. In particular, this identification will endow the set of triangles with the 
topology (not separating triangles with the same aspect) inherited from the usual topology of the 
plane. This convention will implicitly be enforced throughout this paper. The triangle A will be 
said to be flat if y = 0. So up to similitude the set of flat triangles can be identified with [0, 1/2]. 
For n G N, let {Xn,Yn) be the characterizing point of A(n). The ffist result justifies the assertion 
that as the barycentric subdivision goes on, the triangles become flat. 

Theorem 1 Almost surely (a.s.) the stochastic sequence (l^)neN converges to zero exponentially 
fast: there exists a constant x > such that a.s.: 

limsup — ln(y„,) < — x 

n— >oo n 

We will show that we can take x = 0.035 (but this is not the best constant, indeed we will present 
a numerical experiment suggesting that the above bound should hold with x ~ 0.07), nevertheless 
the previous result remains asymptotical. But contrary to Blackwell [2] (see also the remark at 
the end of section 6), we have not been able to deduce a more quantitative bound in probability 
on Yn for any given n G N. 

In particular, we recover the convergence in probability toward the set of flat triangles which 
was previously proved by Diaconis and McMullen [6J (using Furstenberg theorem on products 
of random matrices in 5-L2(M)) and Hough [8j who used dynamical systems arguments (via an 
identification with a random walk on SL2(M)). 

There is a stronger notion of convergence to flatness that asks for the triangles to have an angle 
which is almost equal to vr. With the preceding notation, for n G N, let A^ be the angle between 
[(0,0), {Xn,Yn)] and [(X„,y„), (0,1)], this is the largest angle of A(n). 



Theorem 2 The sequence (A(n))„gN becomes strongly flat in probability: 

V e > 0, lim P[A„ < tt - e] = 

Of course this resuh imphes that {Yn)n£'N converges to zero in probabihty. Note that the 
converse is not true in general: there are isosceles triangles that become flatter and flatter, but their 
maximum angle converges to 7r/2. Indeed, Theorem [2] is more difficult to obtain than Theorem [1] 
because {Xn)nefi does not converge as the next result shows. Define the limit set of this sequence 
as the intersection over p € N of the closures of the sets {X„ : n > p}. 

Theorem 3 Almost surely, the limit set of {Xn)ne'N is [0, 1/2]. 

It follows from Theorem [1] that a.s. the limit set of a trajectory of the triangle Markov chain 
{A{n))neN is the whole set of fiat triangles. 

A crucial tool behind these results is a limiting flat Markov chain Z. Strictly speaking the 
stochastic chain {Xn)nen is not Markovian, but eventually its evolution becomes almost Markovian. 
Indeed, we note that the above barycentric subdivision procedure can formally also be applied to 
fiat triangles and their set is stable by this operation. This means that if Yq = 0, then for any 
n G N, 1^ = a.s. In this particular situation {Xn)neN is Markovian. Let M be its transition 
kernel, from [0, 1/2] to itself. In what follows, Z := (Z„)„gN will always designate a Markov chain 
on [0, 1/2] whose transition kernel is M. An important part of this paper will be devoted to the 
investigation of the Markov chain Z since it is the key to the above asymptotic behaviour. We will 
see that Z is ergodic in the sense that it admits an attracting (and thus unique) invariant measure 
/i on [0, 1/2]. We will also show that /i is continuous and that its support is [0, 1/2] (but we don't 
know if // is absolutely continuous) . 

The plan of the paper is the following: the next section contains some global preliminaries, in 
particular we will show, by studying the evolution of a convenient variant of the isoperimetric value, 
that the triangle Markov chain returns as close as we want to the set of fiat triangles infinitely 
often. This is a first step in the direction of Theorem [TJ In section 3, we begin our investigation of 
the limiting Markov chain Z, to obtain some information valid in a neighborhood of the set of fiat 
triangles. Then in section 4 we put together the previous global and local results to prove Theorem 
[TJ Ergodicity and the attracting invariant measure /x of the Markov chain Z are studied in section 
5, using results of Dubins and Freedman [7], Barnsley and Elton [Ij and Diaconis and Freedman [5] 
on iterated random functions. This will lead to the proofs of Theorem[2]and Theorem[3]in section 6. 
In an appendix we present some numerical experiments which helped us in some technical proofs. 
They will also illustrate the obtained results, in particular to evaluate the constant x of Theorem [1] 
and to see the profile of ^. 

2 A weak result on attraction to flatness 

The purpose of this section is to give some preliminary information and bounds on the triangle 
Markov chain obtained by barycentric subdivisions. By themselves, these results are not sufficient 
to conclude the a.s. convergence toward the set of fiat triangles, but at least they give a heuristic 
hint for this behaviour: a quantity comparable to the isoperimetry value of the triangle has a 
tendency to increase after barycentric subdivision and so to diverge to infinity with time, in the 
mean. 

To measure the separation between a given triangle A and the set of fiat triangles, we use the 
quantity J(A), which is the sum of the squares of the lengths of the edges divided by the area 
(this is well-defined in (0, +00], since the vertices are assumed not to be all the same). We have 



J(A) = +00 if and only if A is flat. Furthermore the functional J is invariant under similitude, 
so it depends only on the characteristic point (x, y) of A and we have 

y y 

in particular we get 

3y < (J(A))-i < 8y (1) 

so that the convergence of y to zero is equivalent to the divergence of J(A) to +00. Note that 
J(A) is comparable with the isoperimetric value /(A) of A, defined as the square of the perimeter 
of A divided by its area of A: 

i/(A) < J(A) < /(A) (2) 

With the notation of the introduction, write for n G N, J„ := J(A(n)). Our first goal is to show 

Proposition 4 Almost surely, we have limsup„_^QQ J„ = +cx). 

The proof will be based on elementary considerations of one step of the barycentric subdivision. 
Consider A a triangle in the normalized form given in the introduction. For simplicity, we denote 
A, B and C the vertices (0,0), {x,y) and (1,0) of A. Let also D, £', F and G be respectively the 
middle points of [A, B], [B, C] and [A, C] and the barycenter of A. We index the small triangles 
obtained by the barycentric subdivision as 

Ai := {A,D,G], A2 := {D,B,G], A3 :- {B,E,G] 

A4 := {E,G,G}, As := {C,F,G}, Ag := {F,A,G} ^^^ 

It is well-known that all the triangles Aj, for i € Jl, 6]], have the same area (one straightforward 
way to see it is to note that this property is invariant by affine transformations and to consider 
the equilateral case). 

Next define, with |-| denoting the length, 

Li := \[A,B]\, L2 := \[B,G]\, L3 := \[C,A]\ 
h - \[D,C]\, h - \[E,A]\, h - mB]\ 

An immediate computation gives that 



;2 X . y ,1 ,2 2;yxl 2 2,2 t t A\ 

h = ^ + ^-x + l, ^2 = ^ + ^ + 77 + 7' h = x+y-x + - (4) 



so we get 



ii ~\~ t-o ~r l'\ o 



Ll + Ll + Ll 4 

These ingredients imply the following probabilistic statement: 
Lemma 5 For any n G N, we have 

4 

E[Jn+l|7^] = -Jn 

where the Ihs is a conditional expectation with respect to %i, the a -algebra generated by A(n), 
A(n-l), ..., A(0). 



Proof 

By the Markov property, the above bound is equivalent to the fact that for any n G N, 

E[J,+i|A(n)] = pn 

Since the Markov chain (A(n))„gN is time-homogeneous, it is sufficient to deal with the case n = 0. 
We come back to the notation introduced above. Because the small triangles have the same area 
and the barycenter cuts the median segments into a ratio (1/3,2/3), we get that 



E[J(A(1))|A(0)=A] = IY.J(^ 



6 

i6 11,61 

/I 2 3 n2 ;2 i2' 

where ^(A) is the area of A. 

■ 

In general the previous submartingale information is not enough to deduce a.s. convergence. 
Taking expectations, we get that for any n G N, E[J„+i] > (4/3)E[J„], thus E[J„] > (4/3)" J(A), 
so we can just deduce L^-divergence of Jn for large n G N, but this is not a very useful result. 

To prove Proposition 31 note that the numbers J^, n G N, are uniformly bounded below by a 
positive constant. Indeed the isoperimetric functional can be defined for relatively general bounded 
subsets of the plane and its minimal value is obtained for discs (cf. for instance Osserman [9]). 
Thus, via ([2]), we get 

4 
V n G N, Jn > ^vr > 4 

But from Lemma [5l we see that 

VnGN, P[J„+i > (4/3) J„ I Tn] > J 



and consequently 



Vn,mGN, P[Jn+m > (4/3)™4|r„] > ^ (5) 



Let i? > 1 be an arbitrary large number and consider m G N* such that (4/3)™4 > R. The {0, 1}- 
valued sequence (lIj„(„+i)>_R)nGN stochastically dominates a sequence of independent Bernoulli 
variables of parameter 1/6™". It follows that a.s. we have 

lim sup Jn > R 

n— >oo 

and since R can be chosen arbitrarily large, Proposition [5] is proven. 

■ 

To finish this section, we will prove another simple preliminary result. 

Lemma 6 There exists two constants < a < b < +00 such that 

VnGN, aJn < Jn+l < bJn 



Proof 

Again it is sufficient to consider the first barycentric subdivision and to prove that we can find two 
constants < a < 6 < +00 such that with the above notation, 

Vie 11,61, aJ(A) < J{Ai) < bJ{A) 

Such inequahties are obvious for flat triangles, so assume that A is not flat. Since the areas 
are easy to compare, we just need to consider the diameters (whose squares are comparable, 
within the range [1,3], with the sums of the squares of the lengths of the edges), denoted by d. 
We have clearly d{A) = 1 and d{Ai) < 1 for i G [1,6]]. The reverse bound d{Ai) > 1/4, for 
i G [1,6]], is a consequence of the equalities \[A,F]\ = 1[F,C]1 = 1/2, \[B,E]\ = \[E,C]\ > 1/2 and 
l[Z),G]l = l[G,C]l/2>l/4. 



3 Near the limit flat Markov chain 

Our goal here is two-fold. First we show that the kernel of the triangle Markov chain converges 
nicely to the kernel of the flat triangle Markov chain as the triangle becomes flat. Second we study 
the evolution of a perimeter related functional for the flat triangle Markov chain, to get a bound 
on the evolution of the isoperimetric functional for the triangle Markov chain, valid at least in a 
neighborhood of the set of flat triangles. 

Let Q be the transition kernel of the Markov chain (X„, Yn)nem considered in the introduction. 
For any (x, y) E T>, the set of characterizing points of triangles, we can write 



Q{{x,y), ■) = -^ Yl S{x,,y, 



where 5 stands for the Dirac mass and where for any i G ]ll,6], {xi,yi) is the characterizing point 
of the triangle Aj described in ([3]). Of course, the Xi and yi, for i £ [[1,6]], have to be seen as 
functions of {x,y). For i E [1,6[[, let us define 

VxG [0,1/2], z,{x) := x,(x,0) (6) 

The transition kernel M on [0, 1/2] of the flat triangle Markov chain alluded to in the introduction 
can expressed as 

VxG [0,1/2], Mix,-) = IYI ^^d^) (7) 

The next result gives bounds on the discrepancy between Q and M as the triangles become flat. 
Lemma 7 There exists a constant K > such that 

V i G [[1,6], V (x,2/) G P, m.a^{\xi{x,y) - Zi{x)\,\yi{x,y)\) < Ky 

Proof 

We first check that for any fixed i G [[1,6]], the mapping 

^2 9(x,2/2) ^ (xi(x,2/),y2(x,y))Gp2 (8) 

is (uniformly) Lipschitz, where P2 is the image of P through V 9 {x,y) f-> (x,y^) G T>2. 

6 



Indeed, denote by < Lj^i < Lj^2 < -^i,3 the ordered lengths of the triangle Aj. We have seen 
in the previous section that 



2L2 



Vi = Tr2 Xi+Xi-l (9) 



Let hi be the height of Aj orthogonal to the edge of length Lj_3, we have Lfi = hf + (xiLi^^)'^ and 
Ll^ = hf + ((1 - Xi)Li,3)2. It follows that 

j2 r2 _|_ j2 
^i = ^72 (10) 

Finally, notice that (jl]) implies that the mappings 2?2 9 {x,y'^) i— ?• -^f ,, for j G [[1,3], are uniformly 
Lipschitz. Furthermore, as seen in the proof of Lemma [6l the mapping P2 3 i^iU'^) '~^ Lf^ is 
bounded below by 1/16, so ([TO]) and ([9j) imply that the mapping described in ([8]) is uniformly 
Lipschitz. 

The bounds given in Lemma [7] are an easy consequence of this Lipschitz property and of the 
boundedness of V. 

m 

The second goal of this section is to study the sign of quantities like E[ln(/ri+i//„)| A(n) = A], 
at least when A is close to a flat triangle. Here we define I„ as the isoperimetric value of A(n). By 
the Markov property, this amounts to evaluating the sign of 1/6 X^iefi 611 ^^i^i^i)/^i^))- Of course 
the previous ratios are not rigorously defined if the triangle A is fiat. Nevertheless, let (x, y) be 
the characterizing point of A. When y goes to zero 0+, I(Aj)//(A) = y/6V{Ai)/V{A) converges 
to G{i,x), which is just the same ratio for the flat triangle A whose characterizing point is (x,0). 
We have, for any x G [0, 1/2] (see the computations of section 5 for more details) 

G(l,x) = ^/l{l + x), G{2,x) = ^^(2 - x) 

G(3,x) = yi(l-x), G(4,x) = -/|(2-x) (11) 

2/0 ^^ n^a ^^ — /s 



G{5,x) = ./f(2-x), G(6,x 



3 V'^ ■^y; ^-jyjj^j -y; 2 



From the previous considerations, we easily get that this convergence is uniform over x, in the 
sense that for any i £ [[1, 6]], 



lim sup 



^(^^) G(.x) 



/(A) 







So to prove that E[ln(/„_|_i//„)|A(n) = A] > for nearly flat triangles A, it is sufficient to show 
that the mapping [0, 1/2] 9 x i— )• X^jgIIi 61 ln(G'(i,x)) only takes positive values. Unfortunately, this 
is not true, since it takes negative values in a neighborhood of 1/2 (see section 7). To get around 
this problem, we iterate the barycentric subdivision one more step. 

Proposition 8 There exist a constant 7 > and a neighborhood N of the set of the flat triangles, 
such that 

V n G N, V A G AA, E[ln(/„4.2//„,)|A(n) = A] > 7 

(for flat triangles A, the ratio is defined as a limit as above, or equivalently, as a ratio of perimeters, 
before renormalisation, up to the factor 6). 



Proof 

Coming back to the notation at the beginning of the introduction, we want to find N and 7 as 
above and satisfying 

Let (x, y) be the characterizing point of A. As y goes to 0+, the left hand side converges (uniformly 
over x) to 

^(^) '■= -^ Y. ln(G(j,z,(x))G(i,x)) (13) 

where the Zi{x)^ for f € [1,6], were defined in ([6]). More explicitly, we will compute in section 5 (see 
Lemma [T2]) that on each of the segments [0, 1/5], [1/5,2/7] and [2/7, 1/2], the Zj, for i G [[1,6]], are 
homographical mappings. So it seems more convenient to consider the piecewise rational fraction 

R{x) ■- exp(36F(x)) (14) 

= n G{i,z,{x))G{j,x) 

After computations (see section 7), it appears that this is indeed a piecewise polynomial function. 
By numerically studying the zeroes of i? — 1 of the three underlying polynomial functions, we show 
that F does not vanish on [0, 1/2]. So by continuity, we get that 7 := minro^i/2] F/2 > 0. Then 
using the above uniform convergence, we can find a neighborhood J\f of the set of flat triangles so 
that (tni) is fulfilled. 



We will see more precisely in section 7 that F is decreasing, so we can take 7 = -F(l/2)/2 ~ 0.035. 

4 Almost sure convergence to flatness 

We are now in position to prove Theorem [H The principle behind the proof is that there is 
a neighborhood M' of the set of flat triangles such that if the triangle Markov chain is inside 
M', then it has a positive probability to always stay in this neighborhood and then to converge 
exponentially fast to the set of flat triangles. This event will eventually occur, since triangle Markov 
chains always return to M'. 

In order to see that the triangle Markov chain has a positive chance to remain trapped in a 
neighborhood of the set of flat triangles, we will use a general martingale argument. To do so, 
we introduce some notation. On some underlying probability space, let (J-"„,)neN be a filtration, 
namely a non-decreasing sequence of ci-algebras. Let 7 > and yl > be two given constants. 
We assume that for any R large enough, say R > Rq > 0, we are given a chain {Vn )neN and a 
martingale {Nn jneNj adapted to the filtration (J-n)neN, satisfying Vq = R, Nq =0 and such 
that for any time n G N, 

Ni^\-Ni^^\ < A (15) 

K!;i-yi«) > 7 + ivi^), - iv(«) (16) 

The next result shows that if R is large enough, with high probability V^^' will never go below 
R/2. This is classical, but without a precise reference at hand, we recall the underlying arguments. 



Lemma 9 We have 

P[3 n E N : yi«) < i?/2] < exp(-7i?/(2A^)) ^ _ ^^^^^^.^^^^^^ 
and furthermore, a.s., 

lim inf > 7 

n— >-oo n 

Proof 

The first estimate is an immediate consequence of tlie Hoeffding-Azuma inequality, whicli, applied 
to the bounded difference martingale {—Nn )neN starting from 0, asserts that for any t G M+, 

V n G N*, F[-N^^^ > t] < e^p{-t'^/{2nA^)) 

In particular, since for any n G N, we have 

Fi^) > R + n-f + Ni^'> (17) 

we get 

P[V;(^) < R/2] < F[-N^^^'> > R/2 + n-f] 

R i?7 n7^ 

< exp 



4n^2 2^2 2yl2 
i?7 \ / 7172 

and the first announced bound follows by summation over n G N*. 

The second bound is also due to the fact that the increments of the martingale N^^' are 
bounded, which implies the validity of the iterated logarithm law (see for instance Stout [10]): a.s.. 



lim sup 



M^^ 



n^oo y^nln(ln(n)) 
thus (fTTll enables us to conclude. 



< A 



Lemma[9]will be applied with V^^' the logarithm of isoperimetric values, or rather with a sequence 
of the kind (ln(/2n))raGN- 

More precisely, consider the neighborhhood A^ obtained in Proposition [HI There exists a small 
constant e > such that A/" contains {{x,y) £ V : < y < e} and so taking into account ([T]), 
there exists -Ri > 1 such that {A : ln(/(A)) > Ri} C A" (again we are slightly abusing notation 
here, identifying triangles with the characterizing points of their normalized forms, this should 
not lead to confusion). Let T be a finite stopping time for the triangle Markov chain (A(n))„gN. 
Assume that R := ln(I(A(T))) satisfies R > 2Ri. Define a stopping time r for the shifted chain 
(A(r + 2n))„eNby 

r := inf{n G N : ln(/(A(T + 2n))) < Ri} 

which is infinite if the set on the r.h.s. is empty. Let 7 > be the constant appearing in Proposi- 
tion [Sj We construct a stochastic chain V^ ' in the following way: 

VnGN y(^) - / M^(A(r + 2n))) , if n < r 

^ ' ^ ■ \ ln(/(A(r + 2r))) + 7(n-T) , otherwise 

9 



Let us check that the assumptions for Lemma[9]are satisfied. Following the traditional Doob-Meyer 
semi-martingale decomposition (see for instance Dellacherie and Meyer [4]), we define 

me[l,n| 

where for any n S N, 7>i is the cr-algebra generated by the trajectory- valued variable (A(?t2 A (T + 
n)))m£N- Using classical stopping time notation, this is the u-algebra Tr+n, where the filtration 
iTm)m&n was introduced in Lemma[5j After conditioning on J^q and taking advantage of the strong 
Markov property, we can apply Lemma [6] to see that (]15p is satisfied with A = {b/a)^ (we even 
have N^_^_\^ — Nn =0 for n> t). Furthermore, we have for any n S N, 

= E[ln(/r+2(n+i)/^T+2n)| A(r + 2n)]]I„<, + jK>r + N^+\ - Ni""^ 

— ' ' n+l n 

where the last inequality comes from Proposition [8l Then Lemma [9] implies that 

Proposition 10 Let J\f' '■= {A : ln(/(A)) > Ri}. There exists a large enough constant R2 > 
2Ri such that for any finite stopping time T for the triangle Markov chain (A(n))„gN satisfying 
ln(/(A(r))) > R2, we have 

F[3n€N : A{T + n)^M'\TT] < 1/2 
Furthermore on the event {V n G N : A(T + n) £ M'}, we have a.s. 

liminfi^^ > 7/2 

n— >oo n 

Indeed, Lemma [9] shows that we can find i?2 ^ 2i?i such that 

P[r <oo|rr] = PpnGN : A(T + 2n) ^AA'ITr] < 1/2 
On the event {V n e N : A(r + 2n) G A/"'}, we have a.s. 

lim inf — > 7 

n— >-oo fi 

Lemma [6] permits extending these results to the statement of Proposition [10] (up to replacement 
of i?2 by bR2/a. 

Now the proof of Theorem[T]is clear. By iteration, introduce two sequences {Sn)neN and (T„)„gN 
of stopping times for the triangle Markov chain: start with 5o = and for any n G N, if Sn has 
been defined, take 

r„ := inf{m > 5„ : ln(/(A(m))) > R2} 
Sn+i := inf{m > r„ : ln(/(A(m))) < R^} 

Of course, if for some n £ N, Sn = 00 then for any m > n, Sm = Tm = 00. Conversely, via 
Proposition^ we see that if Sn < 00, then a.s., T„ < +00, so the events {Sn < 00} and {T„ < 00} 
are the same, up to a negligible set. For n G N, let us define the event 

En '■= {Sn < 00 and Sn+i = 00} 

= {r„ < 00 and V m G N, A(r„ + m) G A/"'} 

10 



Up to conditioning on {Sn < 00} , Lemma [TO] shows that 

?„+i = oolSn < 00] = P[£^„|S'„ < 00] > 1/2 



thus it follows easily that P[UngN-E'n] = 1- Lemma [TOl also shows that on all the En, the sequence 
(I~^)mgN converges exponentially fast to zero with rate at least 7. Now the bound ([1]) implies the 
validity of Theorem [T] with x = 7/2- 

Remark 11 Let 72 '■- F{l/2) = min^.gro,i/2l F{x). A closer look at the proof of Proposition [8] 
shows that for any 7 < 72, we can find a neighborhood J\f of the set of flat triangles such that the 
lower bound of Proposition [8] is satisfied. By the above arguments, it follows that Theorem [1] also 
holds with X = 72/2, so we win a factor 1/2. 
But one can go further. For N £'N\ {0, 1} and x £ [0, 1/2], consider 

Fn{x) '■= -^ X] ln[G{iN,ZiN^i°---° Zii{x))---G{i2,Zi-^{x))G{ii,x)) 

(ii,...,ijv)e[i,6p 



^x 



Y, ln(G(/„+i,Z„)) 

ne[0,Af-l] 



where (tn)ngN* is a sequence of independent random variables uniformly distributed on [1,6]] and 
(Z„)„gN is the Markov chain starting from x (Zq :- x) constructed from {Ln)neN* through the 
relations 

VneN, Z„+i := z,„+i(Z„) (18) 

Then define 

7Af := min Fn{x) 

xG[0,l/2] 

An easy extension of the previous proof shows that Theorem [1] holds with x = 1n/N and conse- 
quently with X = hm7v->oo IN /N . The quantity ^n /N converges due to the weak convergence of 
the Markov chain (Z„)„gN; uniformly in its initial distribution, as we will show in the next section. 
Indeed, if ^ is the attracting invariant probability associated to (Z„)„gpj, we will see that for any 
XG [0,1/2], 

lim E,[ln(G(i„+i,Z„))] = L 

n— >oo 

with 

^ '■= \Y. JHG{t,x))^i{dx) (19) 

It follows from Cesaro's lemma that 

lim M^ = lim 1 Y. IEx[ln(G(wi,ZO)] 

rn-oo iV n— !-oo _/V ^ — ' 

nG[0,Af-lI 

= L 

Since this convergence holds uniformly in j; € [0,1/2], we get that Theorem [1] is satisfied with 
X = L. In section 7, we will numerically evaluate that L ~ 0.07. 

D 
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5 Ergodicity of the limit flat Markov chain 

This section studies the hmit flat Markov chain Z :- (Z„)„,gN- First we will see that it admits 
a unique invariant probability fi and that it converges exponentially fast to /x in the Wasserstein 
distance. Next we will show that ^ is continuous and that its support is the whole state space 

[0,1/2]. 

We begin by describing the kernel of Z given in ^ in the language of iterated random functions. 
Lemma 12 With the notation of the previous sections, we have for all x G [0, 1/2], 

zdx) = ^jE^ z,{x) = if^ 

Proof 

These are immediate computations, based on the fact that for any flat triangle, the abscissa of the 
characteristic point is the ratio of the shortest edge by the longest edge. For instance the lengths 
of the edges of the triangle A2 are Li/2, li/3 and 2/3/3 with Li = x, /i = 1 — | and l^ = \ — x, 
which leads to the above expression for Z2{x). 

■ 

To see that the Markov kernel M of Z is ergodic, in the sense that it admits an invariant and 
attracting probability, we apply a result due to Barnsley and Elton [1]: let S" be a compact 
segment of M (more generally it can be a complete, separable metric space) on which we are given 
n Lipschitz functions fi : S — )• 5, for z S [l,n]]. Let p = (pi)iG[i,n] be a probability on |Jl,n] and 
consider the Markov kernel A^ from S" to S" given by 

VxgS, N{x,-) - Yl Pi^M-) (20) 

Then under the assumption that there exists a constant r < such that 

\f x^y e S, > Pi In ■ ■ < r (21) 

^-^ \ \v — x\ J 

the kernel A^ is ergodic: it admits a unique invariant and attracting probability /i, satisfying 
liN = fi and for any probability v on S, lim„_i.oo i^A^" = fJ- (in the weak topology). Furthermore, 
Barnsley and Elton [1] show that there exists q G (0, 1] and p G (0, 1) such that 

yx,yeS, ^ p,|/,(y)-/,(x)|« < p\y-x\'^ (22) 

iG[l,nI 

Let us rewrite this bound in a more probabilistic way. Let (t„)„gN be a sequence of independent 
random variables taking values in [l,n]] with distribution (pi)ig[[i,n]- For any x G S*, we denote 
by U'-^ := (C/^)„(=N the stochastic chain constructed as follow: Uq = x and for any n G N, U^_^_l = 
fin+i i^n)- This is a Markov chain with transition kernel A^. This construction enables us to couple 
together all the Markov chains U^, for x £ S. Then the above bound can be written 

Vx,yG5, E[\U^-Uf\''] < p\y-x\'^ 

and admits an immediate extension: 

VnGN, Vx,yG5, E[\Uy - U^\'^] < p" |y - x|^ 

12 



This leads us to consider the Wasserstein distance D between probabihty measures on S: if ui and 
V2 are two such measures, 

D{ui,V2) := sup \vi[f] - V2[f]\ 

/G.C(1) 

where C{1) is the set of Lipschitz functions on S whose Lipschitz constant is less (or equal) than 
1. We may now show that 

Lemma 13 Under the above assumption Ii21\). we have for any n G N and any x £ S, 

D{N''{x,-),fJ-) < diam(5)/9" 

where p and q are as in (2^) and diam(5) is the diameter of S. It follows that U^ satisfies the law 
of large numbers: for any continuous function f on S, we have a.s., 



\r->-oo iV + 1 ^-^ 



N^c 



/^m 



(23) 



ne[0,Af] 



Proof 

Let / G /^(l). We compute that 



|iv"(x, /)-/.[/]! 



Mdy)(iV"(x,/)-iV"(y,/)) 



< sup |iV"(:E,/)-iV"(y,/)| 

x,y£S 

= sup\E[f{U:)-f{Uy)]\ 

x,yeS 

< snpE[\U:-Uy\] 

x,y£S 



= diam(5) sup E 

x,y£S 

< diam(S') sup E 

x,yeS 
sl-q 



m-uii\ 

diam(S') 

u^ - uy 



diam(S') 
< diam(5)"'^~'^ sup p^\y — x\'^ 



x,y€S 

< diam(5)p" 

The announced bound follows by taking the supremum over all functions / G C{1). The law 
of large numbers is deduced from a traditional martingale argument based on the existence of a 
bounded solution to the Poisson equation. More precisely, for / G C{1), we can define 

\/xeS, ip{x) := Y.E[f{U^)-^^[f]] 

since the r.h.s. converges exponentially fast and uniformly with respect to x G S". Furthermore we 
easily see that y? is a Lipschitz function and that it is a solution to the Poisson equation 



y xe s, 



<f{x) - N{x,ip) 



f{x)-f^[f] 





This enables us to write for any n G N, 
/(Xo) + /(Xi) + --- + /(X„) 



(n + 1)/U[/] + ip{Xo) - ^(Xn+l) + Mn+l 
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where (A^n)neN is a martingale whose increments are bounded. The law of large numbers for 
functions / belonging to C{1) then follows from the well-known fact that Mn/n converges a.s. to 
zero. It is also true for all Lipschitz functions /. Next, given a continuous function f on S and 
m E N*, by usual approximations, it is possible to find a Lipschitz function /„, on S such that 



f-fn 



S,oo 

Clm of probability 1 , 



< 1/m, where 



1 5,00 



is the uniform norm on S. It follows that on a mesurable set 






lim inf — 

JV->oo N 



ne|IO,7V] 

Thus on the set rimeN*^m of full probability, ([23]) is true. 



Let us discuss condition ([2T|) . Note that since the functions fi, for i £ [[l,n], are Lipschitz, they 
are absolutely continuous. Let us write fl for their respective weak derivatives. By letting y and 
X become close in criterion (j2ip . we get that almost everywhere in x E S", 



iG[l,n] 



(24) 



Condition ()24p is not sufficient to insure that the kernel A^ is ergodic. Consider the following 
example with S* = [0, 1], n = 2 and the functions /i and /2 defined by 



VxE[0,l], Mx) 



min(2x, 1) , if i = 1 

max(0, -l + 2x), if i = 2 

In this case (|24|) is even satisfied with r = —00 and the set of invariant probability measures is 
{a5o + (1 — a)6i : a E [0, 1]}, so none of them can be attractive (but the law of a corresponding 
Markov chain converges exponentially fast to one of the invariant probability measures). 

Nevertheless, under some circumstances, the necessary condition (j24p is also sufficient. This 
is the case if for all the functions \fl\, with i E [[l,?^], there exist ai,bi E M such that almost 
everywhere (a.e.) in x E S, 

\-2 



\M^ 



(aiX + bi 



(25) 



(in particular —hi/ai cannot belong to S otherwise // would not be integrable over this interval). 
Indeed in this situation we can write that for any x < y € S, 



\h{y)-fi{x)\ 



\y -x\ 



< 



1 


P 


\y-x\ 


L 


1 


r 


\y -x\ 


L 


1 


r 


\y-x\ 


L 


1 


I 



f[{z) dz 

|/;W| dz 

1 

{aiz + 6)2 
1 



dz 



1 



ai\y -x\ \aiX + bi aty + h 

1 ai{y- x) 

Qi \y - x\ {aiy + hi){aiX + hi) 
1 



\aiy + bi\ \aiX + bi 



\rM\mx)\ 
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where the last equahty has to be understood a.e. It follows that, at least for a.e. x,y £ S, 

^^( \h{y)-fi{x)\ \ ^ H\fi{y)\) + H\fi{x)\) 
\ \y-x\ J ~ 2 ^ ^ 

and consequently 

^ p.in(^M^^|M] < 1 y: pM\m\) + '^ E pM\fM) 

A formula which enables passing from ()24p to (j2ip . It only has to be checked for a.e. x,y £ S. 

It is time now to come back to the flat triangle Markov chain. Consider the setting where 
N = M, i.e. S = [0, 1/2], n = 6, fi = Zi for i G |1,6] and p the uniform distribution on [1,6]. 
Now condition ([25]) is satisfied. Since z^i^jl—) = —^2(2/7+) and 23(1/5—) = —^3(1/5+), we see 
that |z^|(x) can be defined everywhere (a convention that we will adopt from now on). Indeed, we 
compute that for any x G [0, 1/2], 



\z[\{x) 

141 (x) 
141 (^) 



6 



2(l + x)2' 

2 
3(1 -x)2' 

3 

2(2 -x)2' 



41 (x) = 


(2-x)2 


41 (^) = 


3 


2(2 - xf 


41 (^) = 


2 



Unfortunately (I24p is not true and surprisingly it is a computation we have already encountered: 
comparing with (jlip . we see that 

Vie [1,61, VxG [0,1/2], kil(^) ^ 



G2(i,x) 

thus by the observation before Proposition [HI we know that X^ielfi 61 ^^(l'^i(^)l) ^^ positive for x 
near 1/2. As in section 3, we get around this difficulty by iterating the kernel M one more time 
(this trick was also used by Barnsley and Elton in one example of their paper [1]). So we consider 
A^ = M^, namely 5 = [0,1/2], n = 36, fij = ZiOZj ior {i,j) G [1, 6]] ^ and p the uniform distribution 
on |1, 6]2. The advantage is that we have for any i,j G [1, 6] and any x G [0, 1/2], 

\ft,j\ix) = \zi\izjix))\z-\{x) 

= iG{i,zj{x))G{j,x))-^ 

Thus 

VxG [0,1/2], Y. Hinjix)) = -2F{x) 

and in particular the left hand side is negative due to Proposition [8] (it is even increasing as a 
function of x G [0, 1/2] according to the observation made at the end of section 3). But ([25]) is no 
longer satisfied by the functions fij. To avoid this problem, we come back directly to the bound 
([26D : for i,j G [1, 6] and y > x G [6, 1/2], we write 

^^, \fi,jiy)-kjix)\ \ ^ ^^f \zt{zj{y)) - Zi{zj{x))\ \ ^ ^^ (\zj{y) - Zj{x)\ 



y-x\ J V \zj{y) - Zj{x)\ J \ \y-x\ 

^ H\z[{z,{y))\)+\n{\z[{z,{x))\) ln(|z^(j/)|) + ln(|z;.(x)|) 

~ 2 2 

ln(|/(^.(y)|)+ln(|/(.(x)|) 

2 
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In this situation we can also come back from (j25p to (j24p and the results of Barnsley and Elton [T] 
insure that the iterated Markov kernel M^ is ergodic. To come back from M^ to M is not difficult: 

Proposition 14 The kernel M is ergodic and the Markov chain Z satisfies the strong law of large 
numbers. 

Proof 

Let fi be the attracting and invariant probability for M^. Then we have {fiM)M'^ = {^M'^)M = 
^iM, so fiM is invariant for M^ and by uniqueness it follows that fiM = /i. Next for any probability 
measure u on [0, 1/2], the (weak) limit set of (z/M")„gN is included in {fi,fiM} = {fj.}, so fi is also 
attracting for Af and the uniqueness of /u as the invariant probability of M follows. Finally the 
strong law of large numbers for Z can be deduced from that of the two Markov chains {Z2n)neN 

and (^l+2n)nGN- 

■ 

There was a cruder way to deduce the ergodicity of M: 

Remark 15 Diaconis and Freedman [5] consider a simpler criterion for ergodicity of a random 
function Markov kernel (pOl) : for i G [l,?^]], let Ki := sup^-^j^ Ifiiu) ~ fi{x)\/\y ~ ^l be the Lipschitz 
constant of fi and assume that there exists a constant r < such that 

yx,yeS, Yl P^^^iK^) < r (27) 

iG[l,n] 

then the kernel A^ is ergodic and Diaconis and Freedman [5] show that the convergence is expo- 
nentially fast in the Prokhorov distance (but for us the Wasserstein distance is more convenient 
because in the end we would like to couple the two Markov chains {Xn,Yn)n£N and {Zn)n<=N)- Of 
course condition ([27|) implies ([2T]) . Since for i € [l,n], Ki is the essential supremum of \fl{x)\, 
(j27p corresponds to the exchange of essential supremum and sum in (j2ip . Let us now come back 
to our flat triangle Markov chain. From the previous considerations, (j27p cannot be satisfied with 
N = M. It does not work either with N = M^, so this is an example were the criterion (I2ip 
is fulfilled while ^ is not. But condition ([27]) is satisfied with N = M^, namely S = [0, 1/2], 
n = 216, /ij,fc = ZiO Zj o Zk for {i,j, k) £ [1, 6]]^ and p the uniform distribution on |1, 6]]^. For the 
details of the underlying numerical computations, we refer to section 7. 

D 

To finish we prove two properties of fi which will be needed in next section. The first one is 
Lemma 16 The probability n contains no atom, in particular /^({O}) = 0. 

Proof 

The proof needs a few steps and notation. Let us define 

fi* := sup{^({x}) : X G [0, 1/2]} 
S* - {x G [0, 1/2] : /i({x}) = fi*} 



and for any x G [0, 1/2], 



Six) := {ii,y) G 11,61 x [0,1/2] : Ziiy) = x} 
S{x) = {y G [0, 1/2] : 3 ^ G II, 61 with z,(y) = x} 



The first step is: 

• We have /i({0} = /i({l/2}) < fi*/2. 
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By invariance of /i we can write that 

//({O}) = /i(M[E|o}]) 

= \ E M{y}) 



6 

(i,j/)G5{0) 

= ^MW) + ^M{i/2}) 

and this relation imphes that /u({0}) = ^({1/2}). Next consider the point 1/2. We get 

^({1/2}) = \ Y. ^(iy» 

{i,J/)G5(l/2) 

= ^^({1/2}) + 1m{1/5}) + ^M{2/7}) 
< ^^({1/2}) + i^* 

so it foUows that ^({1/2}) < ^i* /2. 
The next step is: 

• For any x G S"*, we have S{x) C 5* U {0}. 
Looking at the graphs of the functions Zj, for i G [1,6]] (see Figure 1 in section 7), we get that 

r 4 , if a; = 1/2 

V X G [0, 1/2], card(5(x)) = <^ 7 , if x = 1/4 or x = 1/3 

[ 6 , otherwise 

So consider x G 5* \ {1/4, 1/3, 1/2}, writing 

{i,j/)G5(x) 

it appears that equality is possible only if /i({|/}) = ^* for all y G >S'(x), namely ^(x) C S* . 
We study now the three particular cases of 1/4, 1/3 and 1/2. 

- For 1/2: as seen in the first step, 1/2 G S* implies that ^* = 0, so S* = [0, 1/2] and the inclusion 
5(1/2) C S* is trivial. 

- For 1/4: there exist five distinct points yi, 2/2' ^3' 2/4' 2/5 ^ \^i ^1% such that we have 

5(1/4) = {(4,0),(5,0),(l,yi),(2,y2),(2,y3),(3,y;),(6,y^)} 
so by invariance of \i we get 



/^({l/4}) 



M2M{o})+ Y. /^(iy^» 



< 1(2^10})+ 5/i* 
6 



and since we know that /u({0}) < ^*/2, the equality /u({l/4}) = ii* is possible only if /i({y^}) = ^* 
for i G [1,5]], so we can conclude that 5(1/4) C 5* U {0}. 
- The same argument holds for 1/3 (even if 1/3 G 5(1/3)). 

For the last step, let us denote by i2 the restriction of Z2 to [0, 2/7]. This mapping is one-to-one 
from [0, 2/7] to [0, 1/2] and we denote its inverse by 5^ . 
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• For X S (0, 1/2], the set {z2 "'(x) : n G N} is infinite, so S* is infinite. 

The first assertion conies from the fact that for any x G (0, 1/2], < z^ (x) < x, so (2^"'(3;))„£p^ 
is indeed a decreasing sequence (converging to 0). By the first step, S* cannot be reduced to {0}, 
so there exists x S S** \ {0}. By the previous step, the sequence (i^"'(x))„gN is included in S* , 
since none of its elements can be equal to 0. It follows that S* is infinite. 

Of course the last statement implies that fi* = 0, because // is a probability measure. 

■ 

If all of the functions Zi, for i £ [[1,6], were strictly monotone, the fact that //({O}) = could 
have been deduced more directly from the uniqueness of fj, and Theorem 2.10 from Dubins and 
Freedman [7]. The second piece of information we need about ^ is a direct consequence of a result 
of the latter paper. 

Lemma 17 The support of fi is the whole segment [0,1/2]. 

Proof 

By Theorem 4.9 of Dubins and Freedman [7], the support of fi is the whole segment [0, 1/2] if we 
can cover it by the images of the functions Zi which are strict contractions. But this is the case 
here, since Z4 and Z5 are strict contractions and 2:4 ([0, 1/2]) = [1/4, 1/2] and 2:5 ([0, 1/2]) = [0, 1/4]. 



6 More on the asymptotic behaviour 

Our main goal here is to prove Theorems [2] and El The underlying tool is to couple the Markov 
chains {Xn,Yn)neN and {Zn)neN to take advantage of the information we have on the chain {Zn)neN- 

A natural coupling between the above chains is based on the construction alluded to in Remark 
[TTl Assume that {Xq,Yq) and Zq are given and let {in)nen* be a sequence of independent random 
variables uniformly distributed on |1, 6] and independent from the previous initial conditions. We 
consider {Zn)neN constructed as in (fTSJl and similarly we iteratively define {Xn,Yn)n£N via 

VnGN, (X„+i,y„+i) := (x,„+,(X„),y,„^,(y„)) 

In these relations, the indices refer to the conventions made in ([3|) and ^. A first simple property 
of this coupling is 

Lemma 18 The random variables \Xn — Zn\ converge in probability to zero as n goes to infinity: 

V e > 0, lim P[|X„ - Z„| > e] = 

n— >-oo 

Proof 

First we iterate Lemma [3 to show that with K' := K^ + 8K/3 > 0, we have 

V z,j G [1,6], V (x,y) G P, \xi{xj{x,y),yj{x,y)) - Zi{zj{x))\ < K'y 

Indeed, taking into account that all the functions Zj, i £ [[1,6[] have a Lipschitz constant less than 
(or equal to) 8/3, we deduce that for any i,j G [1,6]] and any {x,y) G V, 

\xi{xj{x,y),yj{x,y)) - Zi{zj{x))\ 

< \xi{xj{x,y),yj{x,y)) - Zi{xj{x,y))\ + \zi{xj{x,y)) - Zi{zj{x))\ 

Q 

< K\yj{x,y)\ + -\xj{x,y) - Zj{x)\ 

< K'y+—y 



Let q G (0, 1] and p £ (0, 1) as in (j22p but relative to the kernel A^ = M^. Then for any n G N, we 
can write 

JE[|-'^n+2 — Zn+2\'^\Xn,Yn, Zn] 

= ^[\^L„+2{^i-n+i{^n,Yn),yi„+ii^n,Yn)) " ^t„+2 (^in+i (-^nj) \\Xn,Yn, Zn] 

< E[|x,„^,(x.„+,(X„,y„),y,„^,(X„,y„)) - Z,^^,{z,„^,{Xn)W\Xn,Yn,Zn] 

< {KyY:i + p\Xn-ZnV 

For n G N, denote 

an := EilXn-Znl"] 
hn := (i^')'EK] 

After integration, the above bound leads to 

V n G N, a„+2 < po-n + K 

We deduce that 

VnGN, a2n < ao/?" + ^ 62™/'-'-"^ (28) 

mg[0,ra-ll 

where lim^^oo o-2n = is a consequence of lim^^oo ^n = 0. A similar computation shows that this 
latter condition also implies that lim„_>.oo a2n+i = 0, i.e. at the end we will be insured of 

lim E[|X„ - Z,|5] = 

n— j-oo 

and thus of the announced convergence in probability. 

But we already know that {Yn)n&n converges a.s. to zero and since this sequence is uniformly 

bounded, we see by the dominated convergence theorem that lim„^oo ^n = 0. 

■ 

Now Theorem [2] follows quite easily: 

Proof of Theorem [2] 

For n G N, denote by A'^ (respectively A'^) the angle between [(0, 0), (X„, y„)] and [(X„, y„), (X„, 0)] 
(resp. [(X„,0),(X„,y„)] and [(X„,y„), (1,0)]), so that An = < + <. Since the length of 
[(X„,0), (1,0)] is larger than 1/2 and Yn converges a.s. to zero for large n G N, it is clear that 
A'n converges a.s. to 7r/2. Furthermore, we have tan(A^) = X„/y„, so to see that An converges in 
probability toward tt, we must see that Yn/Xn converges in probability toward 0. Let e, r/ > be 
given, we have for any n G N, 

nYn/Xn > e] = nYn/Xn > e, X„ > 27?] + ^Yn/Xn > e, X„ < 2ri] 

< P[y„>2er/]+P[X„<2r?] 

< P[y„ > 2er/] + P[|X„ - Zn\ > r/] + P[Z„ < r/] (29) 

By letting n going to infinity, taking into account that the stationary distribution ^ of Z is con- 
tinuous, we get 

limsupP[F„/X„ > e] < lim F[Z„ < ry] 

= M[0,r?]) 
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because as n goes to infinity, the law of Zn converges weakly to // and this probability gives weight 
to the boundary \r\\ of (—00, rf\. Using again Lemma [TBI and letting r\ go to zero, we obtain that 
lim„_^cx) '^^nl^n > e] = and consequently the announced convergence in probability. 



Remark 19 We don't know if {An)n&{ converges to vr a.s. One way to deduce this result, via the 
Borel-Cantelli lemma, would be to show that for any given e > 0, 






^ P[yn/^n > e] < +00 (30) 



In view of the above arguments, one of the main problems is that we have no bound on the way 
A*([0) 'y]) goes to zero as r\ goes to zero. We would like to find a > such that lim sup^_j.o+ /^([O) "^D/^" 
< +c«, but we were not able to prove such an estimate. If we knew that yu is absolutely continuous. 
Figure 7 in section 7 would suggest that this property holds with a = 1 (and lim^_s.o_|_ /^([O, ff\)lr\ < 

D 

In order to prove Theorem [3l we need two technical results. In all that follows, let fix some 
a G [0, 1/2] and e > and define O := [a - e,a + e] n [0, 1/2]. 

Lemma 20 There exist ry > and N £ W such that 



inf PJZjv eO] > ri 
2e[o,i/2] 



(the index z means that Zq = z). 



Proof 

This is a consequence of Lemma [13] applied to M^: there exists p G (0,1) such that for any 
z G [0,1/2] and n G N, D{M'^{z,-), fi) < pl-"/2jy'2. Let ip be the function vanishing outside 
(a — e, a + e), affine on [o — e, a] and [a, a + e] such that (p{a) = e. By definition of D, we have 

VzG [0,1/2], VnGN, |M"(z, c/?) - ;u[(/j]| < '-—— 

Since the support of ^ is [0,1/2], we have r/ := /x[c/?] > 0. So if we choose A^ G N such that pL^/^J < ry, 
we get that for any z G [0, 1/2], P^[X7v G O] > M^{z, ip) > r]/2. 

■ 

For the second technical result, we need further notation: for {x,y) G V and {ii,i2,-" i^Af) £ 
[1,6]^, we denote Xj^,J2,...,ij^(x,y), yi^^i^,... ,ij^{x,y) and Zi^^i^^...^i^{x) the values of X„, y„ and Z„ 
when {Xq,Yo,Zo) = {x,y,z) and (/.i,i2,--- ,''Ar) = (^i,«2,--- ,«Af)- 



Lemma 21 There exists a constant Kj\[ such that 

V(ii,i2,--- ,i^) G 11,61^, V(x,y)GP 



N V^/^aA^Ti J l^«l.*2,---,ijv(^i y) ^n,J2,---,Jjv(^)l — ^NU 



\yi^,i2,-,ij^{x,y)\ < K^y 

Proof 

For A^ = 1 this is just Lemma [7l The general case is proven by an easy iteration, similar to the 
one already used in the proof of Lemma [T8l starting with 

— ]^ijv l-^n,J2,--- ,«jv-i l-^' yj' yn,i2,--- ,«jv-i V"^' yJ ~ ^«jvl'^«ii«2,--- ,«jv-i V"^' yJ/] 
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We can now come to the 
Proof of Theorem [3] 

Let O' '- [a — 2e,a + 2e] n [0, 1/2]. We want to show an analogous result to Lemma [20] but for the 
chain {Xn,Yn)neN, namely to find t]' > and N' £ N* such that 

iiif F(^,^y)[XN' e O'] > rl (31) 

(let us recall that under F(a, ,^), (Xg, Iq) = ["^^ v))- To do so, we first consider r\ and A^ as in Lemma 
[20] and consider 5 > sufficiently small such that Kjqb^''^ < e. Then according to Lemmas l20] 
and 121] we have 

(x,y)GX):j/<<5 

To extend this estimate to the whole domain P, we come back to ([T]) and ([S]) which enables us to 
find iV" G N such that 

rl' := inf F^,^y)[YN" < 5] > 

(x,y)GV 

Now the Markov property implies (]3ip with r] = r]'r]" and N' = N + N" . 

Since this bound is uniform over {x,y) £ V, the sequence {'iLo'{XnN'))neN is stochastically 
bounded below by an independent family of Bernoulli variables of parameter rj' and we deduce 
that a.s. 

limsup Ilc)/(X„) = 1 

n—^oo 

The announced result follows because a E [0, 1/2] and e > are arbitrary. 

■ 

The details of the above proof are necessary because in general one cannot deduce from the con- 
vergence in probability of ]X„ — Zn\ to zero as n goes to infinity the a.s. equality of the limit sets 
of {Xn)neN and of {Zn)neN- This property rather requires the a.s. convergence of ]X„ — Z„] to zero 
and this leads to the following observations: 

Remark 22 Coming back to Remark 119] to prove ()30p via ()29p . we are also missing an estimate 
of the kind 

3 E:,P,X>0 : VneN, E[YP] < K exp{-xn) (32) 

Blackwell [2] succeeded in obtaining such a bound (with p = 1/2) by exhibiting an appropriate 
supermartingale with the help of the computer, see also the survey by Butler and Graham [3]. His 
result can be seen to imply Theorem [T] with x = 0.04. 

Furthermore it allows for a more direct proof of Theorem [3j Indeed, if (I32p is satisfied for some 
p > 0, then for any q > 0, 

nGN 

(this is immediate for q = p and use the Holder inequality for < q < p and the elementary 
bound y'^ < {^/3/2)'^^Pyf for y £ [0, v3/2] and q > p). The arguments of the proof of Lemma [TSl 
(especially psp and a similar relation for odd integers) then show that 

J2n\Xn-Zn\''] <+00 
neN 
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and consequently that |X„ — Zn\ converges a.s. to zero. This is sufficient to deduce that almost 
surely the limit set of (X„)„gpj coincides with that of (Z„)„gj^, thus the law of large numbers for 
Z and Lemma [TTl imply Theorem [3l 

In the same spirit, one can go further toward justifying the assertion made in the introduction 
that asymptotically (Xn)n£N is almost Markovian. Let us introduce the supremum distance S on 
[0, 1/2]^, seen as the set of trajectories from N to [0, 1/2]: 



V x := (x„)„gN, z := (z„)neN G [0, 1/2] 



S{x,z) := sup|x„ 



n& 



For m e N, let Xir, 



{Xm+n)neN G [0, 1/2]^ and consider 

Sm ■■= infE[S'(Xj„^oo[,^)] 



where the infimum is taken over all couplings of X 



[m,oo[ 



with a Markov chain Z whose transition 



kernel is M. Then we have linim-^oo Sm = 0. To be convinced of this convergence, consider for 
fixed m G N, (X„,y„)„gN and {Zn)nGN two chains coupled as in the beginning of this section and 
starting from the initial conditions {Xq,Yo) = {Xm,Ym) and Zq = Xm- Then (j32|) and (p8]) imply 
that the quantity ^„gpjIE[|X„ — Z„|] converges exponentially fast to zero as m goes to infinity. 

D 



A Numerical computations 



We present here the technical facts needed to conclude the proof of Proposition [HI We used the free 
numerical computational package Scilab 4.1.2. Next we investigate the Lipschitz constants alluded 
to in Remark [151 Finally we illustrate the results obtained in this paper by giving a numerical 
approximation of the invariant measure /i of the chain Z and of rate L defined in (|19p . 

But as a preliminary, let us give in Figure 1 the graphs of the six functions Zj, for i G [1,6], 
since they are at the heart of this paper. 




Figure 1: Graphs of the Zi, for i G |1, 6] 
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A.l Computations relative to Proposition [8] 

The goal is to show that the function F defined in (J13p does not vanish in [0, 1/2]. Since we want 
to work formally as long as we can, it is better to consider the piecewise rational fraction R given 
in dH]). Here are the codes we wrote and the results given by the computer. 

• On the interval (2/7, 1/2). 

First we compute the matrix A :- {\/6G{i, Zj (x))\/6G(j, 2;))j,jg[i,6] i where x is a "free variable" . 
We multiplied the function G by -v/6 to avoid square roots in the computations. 

x=poly(0,'xO; 

zl=3/2*x/(l+x); 

z2=(2-4*x)/(2-x); 

z3=(2-4*x)/(3-3*x); 

z4=(l+x)/(4-2*x) ;z5=(l-2*x)/(4-2*x) ;z6=(l-2*x)/3; 

z=[zl z2 z3 z4 z5 z6] ; 

gl=2*(l+x) ;g2=(2-x) ;g3=3*(l-x) ; 

g4=2*(2-x);g5=g4;g6=3; 

g=[gl;g2;g3;g4;g5;g6]; 

a=horner(g,z) ; 

A=a*diag(g) 

Running this code, we get the following polynonial matrix: 



A 



/ 4 + lOx 


8-lOx 


10 - 14x 


10- 


-2x 


10 


-8x 


8 -Ax 


4 + x 


2 + 22; 


4-2x 


7- 


- 5x 


7- 


-2x 


5 + 2x 


6-3x 


9x 


3 + 82; 


9- 


-9x 




9 


6 + 6x 


8 + 2x 


4 + 4x 


8-42; 


14- 


-Wx 


14 


- 4x 


10 + 4x 


8 + 22; 


4 + 4x 


8-4x 


14- 


-lOx 


14 


- 4x 


10 + 4x 


\ 6 + 62; 


6-3x 


9-9x 


12- 


- 6x 


12 


— 6x 


9 



By construction, we have 



R{x) 



r36 ii 



636 



«J V-^J 



ij6ll,6l 



29q17 



2^«3 



636 



n 



Bij{x) 



where the matrix B is defined by 



B 



/ 2 + 5x 4 - 5x 5 - 7x 

4 + x 1 + x 2 — X 

2 — X X 1 + x 

4+x 1+x 2-x 

4+x 1+x 2-x 

\ 1 + x 2 — X 1 — X 



- X 
5x 

- X 
5x 
5x 

- X 



5 -4x 
7-2x 

1 
7-2x 
7-2x 
2-x 



2-x \ 

5 + 2x 
1+x 
5 + 2x 
5 + 2x 

1 / 



thus we obtain that R{x) — 1 = Pi{x) on [2/7, 1/2], with 

Pi (x) := 2"'^3~^^ (2 + 5x) (4 - 5x) (5 - 7x) (5 - x) (5 - 4x) (2 



^A + xfil + xf 



(7 - 5xf{7 - 2xf{5 + 2xfx{l 



X 



1 



Using the command roots of Scilab 4.1.2 which computes numerically the roots of a given poly- 
nomial function, we obtained the roots of Pi(x) depicted as crosses in Figure 2. 
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Figure 2: Roots of Pi{x) 

The segment [2/7, 1/2] is drawn as the thick horizontal segment in the middle of the picture 
and it appears (by zooming) that none of the roots are on it. Indeed looking at the roots found 
by the computer, the one on the left (respectively the right) of the segment is 0.0006183 (resp. 
0.6297289). The fact that i? > 1 on [2/7, 1/2] comes from the computation of R{l/2) w 12.989284. 
We also estimate that R{2/7) ^ 99.311045. 

• On the interval (1/5,2/7). 

The first code above has only to be slightly modified: the definition of z2 (line 4) is now 

z2=(3*x)/(2-x); 

The corresponding matrix A is also polynomial and the computer gives us 

10-22; 10 - 8x 8 - 4a; \ 

7 - 5x 7 - 2a; 5 + 2a; 

9 - 9x 9 6 + 6x 

14 - 10a; 14 -Ax 10 + 4x 

14 - 10a; 14 -Ax 10 + 4x 

12 - 6a; 12 - 6x 9 



A 



/ 4 + 10a; 


4 + 4x 


10 - 14x 


4 + a: 


4 - 5x 


4-2x 


6-3x 


6-12x 


3 + 3x 


8 + 2x 


8- 10a; 


8-4x 


8 + 2x 


8- 10a; 


8-4x 


\ 6 + 6x 


6 -3a; 


9-9x 



/ 

Then we compute the roots of the polynomial function P2ix) representing R(x) — 1 on [1/5, 2/7]. 
As can be seen in Figure 3, none of them belong to this segment (drawn as thick line, the closest 
root is 0.4569663). 

Thus R is strictly positive on [1/5,2/7], since we already knew that R{2/7) > 0. We estimate 
that -R(l/5) ^ 418.66239. 

• On the interval (0, 1/5). 

The additional modification is the definition of z3 (line 5), 

z3=(l+x)/(3-3*x); 
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Figure 3: Roots of P2{x) 



Again we obtain a polynomial matrix A, 



A 



f 4 + lOx 


4 + 4x 


8- 


-Ax 


10- 


-2x 


10 


-8x 


8-4x \ 


4 + x 


A-5x 


5- 


-7x 


7- 


-5x 


7- 


-2x 


5 + 2a; 


6-3x 


6-12x 


6- 


12x 


9- 


-9x 




9 


6 + Qx 


8 + 2x 


8-lOx 


10- 


-14a; 


14- 


- Wx 


14 


-Ax 


10 + Ax 


8 + 2x 


8-Wx 


10- 


-14x 


14- 


- Wx 


14 


-Ax 


10 + Ax 


^ 6 + 6x 


6-3x 


9- 


-9x 


12- 


- 6x 


12 


- 6x 


9 J 



and Figure 4 gives a picture of the roots of P^ix) representing R{x) — 1 on [0, 1/5]. None of them 
belong to this segment (the closest one is 0.3995404). 
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Figure 4: Roots of ^3(2:) 
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Thus R is also strictly positive on [0,1/5], since we have already seen that R{l/5) > 0. We 
estimate that R{0) ^ 13496.561. 

In conclusion, the function F defined in (I13p is positive on [0, 1/2], it is illustrated in Figure 5. 




0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 



Figure 5: Graph of function F 

There is a more analytical approach to justify the positivity of F. Indeed, the above polynomial 
expressions for the three matrices A (one on each of the segments [0, 1/5], [1/5, 2/7] and [2/7, 1/2]) 
gives a simple uniform bound C on F' on G (0, 1/2) \ {1/5, 2/7} via the formula 



V I e(0, 1/2) \ {1/5,2/7), 



l^'WI = ^ 






K^(^) 



Aij [x) 



< 



max 

ij6[l,6] 



K.i^) 



-^i,j\X) 



Working out the details, we get that we can take C = 14/3 (obtained for i = 3, j = 2 and x 
for the matrix A valid on the interval [1/5, 2/7]). Thus if there exists A^ G N* such that 



2/7 



min Fii/N) 



14/(6iV) > 



66 and it 



we can conclude to the positivity of F on [0, 1/2]. We tried with N = [14/(3F(l/2))] 
works (except that we have not taken into account the precision of the computer). 

The fact that F and R are decreasing on [0,1/2] can also be checked numerically. Indeed, 
using the command derivat of Scilab 4.1.2, we can compute first the derivatives of the polynomial 
functions Pi, P2 and P3 and next their roots. It appears that they do not belong respectively to 
the segments [2/7,1/2], [1/5,2/7] and [0,1/5]. Since we have already seen that F(0) > F(l/5) > 
F(2/7) > -F(l/2), it follows that F is decreasing on [0, 1/2] and thus bounded below by -F(l/2). 

As suggested to us by Andre Galligo, there is also a totally algebraic approach to this problem 
via the use of Sturm's sequences. Since we are dealing with polynomial functions with rational 
coefficients, this method can be made rigorous, if we are allowed to work on the computer with 
numbers with a large number of digits (here of order 42 = [log]^Q(36!)] , since we have to consider 
the values of all the derivatives of the polynomial function of interest at the boundary of the interval 
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where we are looking for roots), in order to deal only with true "integers". So this approach would 
require working with mathematical software such as Maple. 

Finally let us also draw the graph of the function 



F : [0,l/2]9x ^ \Y1 HG{hx)) 



to show that it was necessary to iterate the kernel M to get a lower bound as the one in Proposition[8] 
(or to be able to apply the result of Barnsley and Elton [1]). 
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Figure 6: Graph of function F 

Indeed, F is negative in a neighborhood of 1/2, which means that the isoperimetric value of 
the triangles near this neighborhood have a tendency to decrease. But fortunately, the abscissa of 
the characteristic point of new triangle obtained by the barycentric subdivision has a chance of 4 
over 6 (see Figure 1) to be close to zero, a zone where the isoperimetric functional has a strong 
tendency to increase. 



A. 2 On the Lipschitz constants of Remark [15 



We implement on the computer the verification of the criterion (j27p from Diaconis and Freedman 
[5] in our particular situation. We adopt an analytical approach, despite the fact we are essentially 
dealing with polynomial functions. Let us denote (recall the notation of (jlip ) 

ViG [1, 61, VxG [0,1/2], D{i,x) := \z'A{x) 



G2| 



i,x 



and for any n G N* and {i,j) G Jl,6p, 



Kij{n) := sup D{i,Zj{x))D{j,x) 

X&Sn 

where Sn '■= [0, n]]/(2n) is a mesh of [0,1/2]. Then for any (i,j) G [[l,6p, the Lipschitz constant 
of Zi o Zj is given by 



K, 



»j 



sup D{i,Zj{x))D{j,x) 

xG[0,l/2] 

sup Kij{n) 

nGN* 
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Thus to show that criterion ()27p is not satisfied by M^, it is sufficient to check that for some 

n G N*, /C2(n) := n(»j)G[i,6F ^m(«) > 1- 

Similarly for M^, we define for any n E N* and any (z, j, k) S [1, 6]]^, 

Kij,k{n) := sup D{i,ZjO Zk{x))D{j,Zk{x))D{k,x) 

xGS„ 

so that for any {i,j,k) £ [l,6p, the Lipschitz constant of Zi o Zj o z^ is given by i^ij,fc = 
sup„gN-Rrij,fe(n). We compute that 



max D(i,x) 

iGll,6l,xe[0,l/2] 



max 

iG[[l,6],a:e[0,l/2] 



dD{i,x) 



dx 



3 

32 

T 



so we easily deduce that 



max 

(i,j,A:)G[[l,6P,xG[0,l/2] 



—D{i,Zj o Zk{x))D{j,Zk{x))D{k,x) 



< 



195552 
243 



< 805 



It follows that 



V n G N*, V {t,j, k) G |1, 6f , K,,,- ^ < i^,,,, ^(n) + 



202 



n 



Thus to see that criterion ()27p is satisfied by M^, it is sufficient to check that for some n S N*, 

202^ 



/C3(n) := 



n 



(i,i,fc)eli,6l3 



Kij^kin) + 



n 



< 1 



The following program (where the index set [[1,6] has been replaced by the more convenient set 
[[0,5]) gives that /C2(l) = k2 « 92.067225 and /CsllOOOO) = k3 w 0.0116774, values which justify 
the assertions made in Remark 1151 

function z=Z(x,i) 

select i 

case then z=3/2*x/(l+x) ; 

case 1 then z=3*bool2s( (x<2/7))*x/(2-x)+bool2s((x>=2/7))*(2-4*x)/(2-x) ; 

case 2 then z=bool2s( (x<l/5)) . *(l+x) ./(3-3*x)+bool2s((x>=l/5)) .*(2-4*x) ./(3-3*x) ; 

case 3 then z=(l+x)/(4-2*x) ; 

case 4 then z=(l-2*x)/(4-2*x) ; 

case 5 then z=(l-2*x)/3; 

end; endfunction 



function d=D(x,i) 
select i 

case then d=3/2 ./(l+x)."2; 
case 1 then d=6 ./(2-x)."2; 
case 2 then d=2/3 ./(l-x)."2 
case 3 then d=3/2 ./(2-x)."2 
case 4 then d=3/2 ./(2-x)."2 
case 5 then d=2/3 
end; endfunction 
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function d=DD(x,i) 
j=f loor(i/6) ; k=i-6*j ; 
d=D(Z(x,k),j)*D(x,j); 
endf unction 

function d=DDD(x,i) 

j=floor(i/36); k=f loor((i-36*j)/6) ; l=i-36*j-6*k ; 
d=D(Z(Z(x,l) ,k) , j)*D(Z(x,l) ,k)*D(x,l)+202/10000; 
endf unction 

r2=[0 1/2]; 12=0:35; K2=f eval(r2, I2,DDD) ; 
k2=prod(max(K2,'rO) 

r3=0: 1/(2*10000) : .5; 13=0:215; K3=f eval(r3,I3,DDD) ; 
k3=prod(max(K3,'rO) 

A. 3 Approximation of the invariant measure fi 

Proposition [14] leads to an approximation of the invariant measure // through the strong law of 
large numbers. In practice, we run the following code, which simulates the chain {Zn)neli,N} with 
N = 100000 starting from an initial variable Zi chosen uniformly over [0, 1/2]. 

n=100000; 

z(l)=.5*rand(); s(l)=0; 

for i=2:n 

j=f loor(6*rand()) ; 

z(i)=Z(z(i-l),j); 

s(i)=s(i-l)-log(D(z(i-l),j))/2; 

end 

L=s(n)/n 



The program takes into account the functions Z and D defined in the code of the previous subsection. 
Figure 7 shows the approximation we got of /i through a histogram using 100 bars. 

Finally, the code provides an approximation of the rate L defined in ()19p . Running the program 
several times, we always got values between 0.07 and 0.08, suggesting that Theorem [1] should be 
satisfied with x = 0.07. 
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